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Abstract 



We apply the concept of scaling to the study of some statistical 
properties of financial data, showing the link between scaling behav- 
ior on one hand and correlation properties on the other hand. A 
non-parametric approach is used to study the scale dependence of 
the empirical distribution of the price increments (x(t + T) — x(t)) 
of S&P Index futures, for time scales ranging from a few minutes to 
a few days using high-frequency price data. We show that while the 
variance increases linearly with the timescale, the kurtosis exhibits 
anomalous scaling properties, indicating a departure from the iid hy- 
pothesis. Study of the dependence structure of the increments shows 
that although the autocorrelation function decays rapidly to zero in a 
few minutes, the correlation of their squares exhibits a slow power law 
decay with exponent a ~ 0.37, indicating persistence in the scale of 
fluctuations. We establish a link between the scaling behavior and the 
dependence structure of the increments : in particular it is shown that 
the anomalous scaling of kurtosis may be explained by " long memory" 
properties of the square of the increments. 

Nous etudions les proprietes statistiques des fluctuations des prix 
boursiers en utilisant le concept de lois d'echelle, en mettant en valeur 
le lien entre lois d'echelle et correlations temporelles des increments 
de prix. Utilisant des donnees haute-frequence nous etudions, avec 
une approche non-parametrique, la maniere dont la distribution Pt 
des increments de prix varie avec l'echelle de temps T a travers la 
dependance de sa variance et sa kurtosis par rapport a l'echelle de 
temps T, pour T allant de quelques minutes a plusieurs jours. Alors 
que la variance croit lineairement avec l'echelle de temps, le kurtosis 
manifeste une decroissance plus lente que celle d'une marche aleatoire 
a increments independants, indiquant la presence d'une dependance 
non-lineaire dans les increments, ce que confirme l'etude de fonctions 
de correlation non-lineaires, revelant une autocorrelation positive du 
carre des fluctuations decroissant en loi de puissance avec un exposant 
a ~ 0.37. Nous mettons egalement en evidence le lien entre cette 
persistance du carre des increments d'une part et la decroissance lente 
de la kurtosis avec l'echelle de temps d'autre part. 
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1 Motivation 



The distributional properties of asset price increments are important both 
from a theoretical point of view, for understanding market dynamics, and for 
numerous applications, such as the pricing of derivative products and Value- 
at-Risk estimations, in which distributional assumptions play a crucial role. 
These applications, and many others, involve different time horizons, ranging 
from a few minutes, the typical timescale on which market transactions take 
place, to several months which is the time horizon portfolio managers are 
concerned with. This requires knowing the distributional properties of price 
increments at various time scales. 

The problem of comparing the distributions of price changes at various 
time scales is also interesting from a fundamental point of view. Indeed, 
as we explain below, whereas studying the distribution of increments on 
a single timescale cannot distinguish between a process with independent 
increments and one with dependent increments, studying the deformation 
of the distribution under a change of time scale provides insight into the 
dependence structure of the time series. We will further elaborate on this 
point in section 5. 

Many studies have been conducted on the distributional properties of 
asset prices and returns (for a review see ]15| ) . However most of these studies 
focus on the distributional properties of returns for a given value of the 
timescale r, using daily, weekly or monthly data for example. 

Mandelbrot was the first to emphasize the idea of comparing the distri- 
butional properties of price changes at different time scales 0. The idea 
behind Mandelbrot's approach is that of scale invariance: the distribution 
Pt of price changes on a time scale T may be obtained from that of a shorter 
time scale r < T by an appropriate rescaling of the variable: 



where H is the self-similarity exponent. The simplest scenario of a scale 
invariant price process is that of a Levy flight a random walk with iid incre- 

2 Most studies focus on the statistical properties of returns, defined as (x(t + T) — x(t)) 
where x(t) is the logarithm of the price. In this study however, we have chosen to focus 
instead on the properties of price increments : x(t) is taken to be the price of the asset at 
date t. 
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ments having a stable Levy distribution [0], [Tj. This was indeed the solution 
proposed by Mandelbrot ||. In this case, if the increments have a //-stable 
Levy distribution then the process is self-similar with exponent H = l/^u. 

In the recent years the availability of high frequency financial data has 
rendered possible the study of price dynamics over a wider range of time 
scales. Several studies have been done in a spirit similar to that of Mandelbrot 
H in the quest of self-similarity in financial time series, on the S&P 500 index 
T^j and on the CAC40 (Paris stock index) ||18|| . This scale invariant behavior, 



observed for short time scales (up to 1000 minutes in breaks down for 
longer time scales [0. Explanations for these observations have been given 
in terms of truncated Levy flight models@, |i~3|j . 

While inspired by the above approaches, the present study has a different 
aim: we argue that the existence of non-linear correlations and anomalous 
scaling indicate that simple random walk models may be insufficient for mod- 
eling finer aspects of price fluctuations. Without assuming any scale invari- 
ance or self-similarity property, we attempt to characterize the deformation 
of the probability density function Pt under a change of time scale T by 
studying the scaling behavior of its variance and its kurtosis, showing that 
their scaling properties deviate from that of an iid random walk. 

Knowledge of the scale dependence of distributional properties of price 
changes enables us to extract more information on the dynamics of series 
than studying it on a single time scale, providing information about the 
dependence structure of the increments : in particular, we show that a link 
can be established between the scaling behavior of the cumulants and high- 
order correlation functions of the time series. 



2 The data set 

The study has been conducted on a high frequency data set of S&P Index 
future prices between October 1991 and September 1995. There are four 
maturity dates each year for S&P futures contracts and market activity is 
most intense for the contract closest to its maturity. We have constituted our 
data set by recording for each period the price of the S&P futures contract 
which has the greatest liquidity among all contracts traded in the market, 
using the above criterion: x{t) is the price of the (unique) futures contract 
maturing less than three months from t. The procedure of sticking together 
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various price series may artificially introduce price jumps at the maturity 
dates. The presence of these jumps may in turn influence various statistical 
estimates such as the variance or kurtosis. To avoid this, we have shifted all 
prices in the next data set by a constant so as to equalize the last price of a 
given set with the first price of the next one, thus eliminating these jumps. 
Prices of adjacent contracts are shifted by a constant value such that there is 
no jump between adjacent prices at maturity dates. Since our study focuses 
on statistical properties of price differences, shifting adjacent prices by the 
same constant has no effect whatsoever on our results. 

Prices are recorded at 5 minute intervals, the price retained being the 
last price recorded during the corresponding 5-minute interval. The data set 
thus constituted contains more than 75 000 prices. Throughout the article, 
time is given in multiples of r = 5 minutes ( T = Nr). N = 84 corresponds 
to a working day, N=420 to a regular working week. 

The size of our data set enables us to study timescales ranging from 5 
minutes (N=l) up to a few days ( N= 200). Estimates for the longest time 
scale (N=200) thus correspond to averages over a sample of size > 3500, 
enabling large sample approximations to be used. 



3 Probability density function of the incre- 
ments 

Let K be a smooth, positive function with the property 

/+oo 
K{x)dx = 1 
-00 

If (Xj) n >j>i is a sample realization of a random variable X the kernel 
estimator ]17] defined by 

1 n r — X- 

gives a smooth estimator of the probability density function (PDF) of the 
variable X. We have calculated a kernel estimator for the PDF of the distri- 
bution of the 5 minute price changes using a Gaussian kernel i.e. K(x) = 
cxp {-x /2) ^ s ] lown j n Figure 1. The non- normal character of the distribution 
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is better illustrated when it is compared to a Gaussian curve with same mean 
and variance, also shown in Figure 1. 

This impression is confirmed upon calculating the excess kurtosis K 5min 
of the distribution (see section 4): k — 15.95. This leptokurtic character of 
price changes is a constant feature of high frequency data. 

Another feature of high frequency data is that drift effects are negligible 
compared to fluctuations i.e. the ratio of the mean to the standard deviation 
is very small. This ratio is found to be m/a ~ 3.3 1CT 3 in our case: price 
variations due to drift are roughly a thousand times smaller than those due 
to 'volatility' - fluctuations around the mean value m. Therefore for practical 
purposes the distribution may be considered as centered around zero as long 
as the time scales T considered are such that mN <C a^/N i.e. N <^ N = 
(a/mf = 9000 ~ 3 months. 



4 Scaling properties 

We calculate for each time scale T = Nr the following quantities: 



a{Tf = {( x (t + T)-x{t))-{x{t + T)-x{t))Y 

[( x (t + T)-x(t))-(x(t + T)-x(tW o 

where the symbol (.) denotes a sample average: 

A=-±A(t) 

a(T) is an estimator of the variance of the distribution P T and k(T) an 
estimator of the kurtosis, the kurtosis of a random variable X being defined 
as: 

E[(X - E{X)Y] 
X E[(X - E(X)) 2 } 2 

(where E denotes expectation value) defined such that k x = for a 
gaussian random variable. 
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Going from a shorter time scale r to a longer time scale Nt formally 
corresponds to summing N random variables: 



N 

x(t + Nt) - x(t) = Sx j 
i=i 

If increments are stationary and independent, the distributions for various 
timescales are simply related by a convolution relation: 

P Nt = P T ® P T ® P T 

where ® denotes convolution. One can then deduce simply from this re- 
lation that for a stochastic process with iid increments with finite variance, 
the variance of the distribution at scale T = Nt increases linearly with iV 
and that the kurtosis decreases as 1/N 0. More generally if Ck(N) is the 
k-th semi-invariant or cumulant [171, of the distribution Py = Pnt then the 
normalized cumulants 

MNr) - CtiNT) - A ' (T) 



a{NT) k N k / 2 - 1 

tend to zero for large N, which is another way of seeing the Central Limit 
Theorem: the cumulants of order > 3 of the limit distribution are all zero, 
a property which characterizes the Gaussian distribution. For example the 
skewness A 3 (iW) decreases as 1/y/N. Note that A 4 (iVr) is the kurtosis k(Nt) 
of the distribution Pn t - 

There is no a priori reason to believe that the price of an asset has inde- 
pendent increments. However, it is now known that the independence of 
increments is not a necessary condition for the Central Limit Theorem to 
apply HJ: some types of dependence structures still allow for convergence 
to the Gaussian distribution as T —>■ oo in which case the distribution will 
resemble more closely a Gaussian at longer time scales than at shorter ones 
and Afc(T) —>■ as T —>■ oo. However there is no reason anymore for the 
normalized cumulants Afc(A^r) to decrease as 1/N k ^ 2 ~ 1 as in the iid case: the 
presence of 'non-trivial' scale dependence, which we term anomalous scaling, 
is the signature of a departure from the iid case. 

We have examined the empirical behavior of a(Nr) and k(Nt): Figure 
2. illustrates the scaling behavior of the variance; it can clearly be seen that 
the variance is linear in the timescale i.e. cx(A^r) = No-(t), as it would be for 
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independent increments. This property illustrates the the additivity of the 
variances of increments, which is equivalent to the absence of autocorrelation, 
discussed below. 

However if we examine the scaling behavior for higher cumulants of the 
distribution Pt we notice that their behavior is different than for an iid 
random walk. Figure 3 shows the behavior of the kurtosis under a change 
of timescale. As indicated above, in the case of iid increments, the kurtosis 
decreases by a factor 1/N when we multiply the time scale by iV : k[Nt) = 
k(t)/N. In Figure 3 we have given a power-law fit of the kurtosis k(Nt) as 
a function of the time scale T = Nr. We find that k(Nt) decreases more 
slowly than 1/N: 

k(Nt) ~ K(r)/N a 

where a ~ 0.5. 

In principle one could calculate higher order cumulants of the distribution 
and examine their scaling properties. However, the error bar for such calcu- 
lations increases drastically as we go higher in the order of the cumulants. 



5 Correlation and dependence 

It is a well-known fact that price movements in liquid markets do not ex- 
hibit any significant autocorrelation: the autocorrelation function of the price 
changes 



_ Sx t 5x t+ T - Sx t Sx t+T 
var(5x t ) 

rapidly decays to zero in a few minutes: for T > 15 minutes it can be 
safely assumed to be zero within estimation errors The absence of eco- 
nomically significant linear correlations in price increments and asset returns 
has been widely documented (see |T5| and references within) and often cited 



as support for the "Efficient Market Hypothesis" ||. The fast decay of the 
correlation function implies the additivity of variances: for uncorrelated vari- 
ables, the variance of the sum is the sum of the variances. The absence of 
linear correlation is thus consistent with the linear increase of the variance 
with respect to time scale. 
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However, the absence of serial correlation does not imply the indepen- 
dence of the increments: it is well known that two random variables may 
have zero correlation yet not be independent. In the same way, one may 
give examples of stochastic processes with uncorrelated but not independent 
increments. One example, given in || is the following: let X be a random 
variable uniformly distributed on [0, it] and define the discrete time stochas- 
tic process (U t )t>o by the recurrence relation U t = U t -i + sin(tX). A simple 
calculation then shows that the increments of (U t )t>o are uncorrelated; how- 
ever, they are not independent: indeed, they are deterministic functions of 
the same random variable, X. 

The simplest non-parametric test of the iid hypothesis is the sign test: if 
the increments have the same median value m then the number T of values 
exceeding m in a sample of size n has expected value n/2 and the statistic 
s = 2T J^ L has a standard normal distribution for large samples. The median 
m is taken to be the sample median, zero in this case. 

We calculate the sign statistic 

2T-n 



n 

for tick data, 5 minute price changes and 30 min price changes. The 
interval between ticks is irregular and depends on market activity but it is 
less than one minute. The results are the following: 

Stick — 6.66 , S§ m i n = —2.1 , Szomin = 1.25 

Assuming that the sign statistic has a N(0,1) distribution and using a confi- 
dence interval of two standard deviations, the results show that although the 
signs of price movements at very short time resolutions (less than a minute) 
are significantly correlated, they can be safely considered as independent (as 
far as the sign test is concerned) for time scales beyond 30 minutes. Even 
for 5 minutes s is close to 2 standard deviations. 

From a formal point of view, independence is characterized by the fact 
that for any (measurable) functions f,g the quantity 

C f , g (X, Y) = E[f(X)g(Y)] - E[f{X)]E[g{Y)] 

vanishes i.e. the variables f{X) and g(Y) are uncorrelated for any non- 
linear functions / and g. In particular, Cfj(X,Y) = : applying any non- 
linear function to an independent sequence gives an uncorrelated sequence. 
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In our case this means that increments are stationary and independent if 
and only if for any function / the series (f(x(t + T) — x(t))) t >o is serially 
uncorrelated. This criterion indicates a simple non-parametric way of testing 
the hypothesis of iid increments: for a given function / we calculate 

r m f{Sx t )f{Sx t+T ) - f{Sx t ) f{Sx t+T ) 
Uf{ } varf(5x t ) 

Formally, Cf(T) corresponds to a resummation of a certain subclass of 
correlation functions of the increments (Sx t )t>o, with weights corresponding 
to the Taylor coefficients of /. We use the following non-linear test functions: 
fi(x) = x 2 ,f'2{x) = cos(x) and fs(x) = ln(l + x 2 ). In each case we calcu- 
late the autocorrelation of the series (f(5xt))t>o- The results are displayed 
in Figure 5. It can be seen that the series (fi(Sx t ))t>o and (f3(Sx t ))t>o ex- 
hibit slowly decaying serial correlations, showing the presence of nonlinear 
dependence in the data. 

To obtain a more detailed picture of the dependence structure of the 
increments, we proceed to calculate higher order correlation functions. It 
is well known that the square of the returns -or any other measure of the 
amplitude of fluctuations for example the absolute value of the increments- 
exhibits significant autocorrelation. 

Figure 6 compares the autocorrelation of the price changes to that of their 
absolute value. In contrast to the autocorrelation function of the increments 
which decays rapidly to zero in a few minutes, the autocorrelation of their 
absolute value decays slowly to zero while staying positive, indicating persis- 
tence in the scale of the fluctuations, a phenomenon which can be related to 
the well known " clustering of volatility" . 

Another measure of the scale of the fluctuations os given by the square 
of the increments; Figure 7 displays the autocorrelation function g(T) of the 
square of the increments, defined as: 



i aj\ — — ^ X t^ x t+T dx 2 Sx 2 +T _ 5x 2 5x 2 +T 5x 2 Sxf +T 
9{ } ~ 9K t'~ var{Sx 2 ) ~ /x 4 (r) - <j(t) 4 U 

Fitting g(T) by an exponential gives very bad results; however the slow 
decay g(T) is well represented by a power law: 
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g(k) ~ |^ a = 0.37 ± 0.037 g = 0.08 

the exponent a and the constant go being obtained by a regression of 
In g(T) against T. We will see in the following section that this approximation 
is precise enough to finely capture the scaling behavior of the kurtosis. 

6 Relation between scaling behavior and cor- 
relation structure 

The results of the two preceding sections may be blended into a consistent 
picture of the dynamical properties of the price process by remarking that 
the scaling behavior of the semi-invariants of the increments is related to 
their correlation structure. 

There is a simple, well-known example of relations between correlation 
functions and scaling behavior: the relation between the variance of a sum 
and the covariance of the addends. The variance at time scale Nr may be 
expressed as 

N 

a\Nr) = $> 2 (T) + 2a 2 (r)EC((^-0T) 

k=l k>l 
N N 

k=l k=l 

N k 

= iVa 2 (r)[l + 2£(l--K^r)] 

k=l 

where C(T) is the autocorrelation function of the increments, defined 
above, therefore implies <j 2 (Nt) = Na 2 (r). The absence of autocorrelation 
implies linear scaling of the variance. 

A similar relation may be derived between the scaling behavior of the kur- 
tosis and the autocorrelation of the square of the increments (the function g 
defined above) ||L6| . Using the results above, we model the signs of the succes- 
sive price changes as an independent sequence with zero mean, independent 
from their absolute value. Using these hypotheses the fourth moment may 
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be expressed in terms of the moments and correlation functions of shorter 
time scale increments 8x; the calculation, details of which are given in the 
appendix, gives: 

/i 4 (Nt) = Nfj, 4 (r) + 3(N 2 + N)a(r) 4 + 

N N 

6N(^ - a{rf) £ g(k) - 6(^ 4 - a(r) 4 ) £ kg(k) 

k=l k=l 

The scaling behavior of the kurtosis is therefore related to the behavior 
of the correlation function g; if g exhibits a slow power law decay g(k) ~ |£ 
then Ef=i g(k) ^ go/ (I - a)N Y - a and E£Li kg(k) ~ y /(2 - a)N 2 ' a so: 

^(iVr) 
a 4 (iVr) 

«(r) 6(«(r) + 2) 

AT (2-a)(l -a)iV a 

Figure 8. shows good agreement for large values of N (N > 50) between 
the right hand side of the above equation and the kurtosis as a function of 
time resolution, plotted on the same diagram. We have thus shown that the 
anomalous scaling of the kurtosis is well accounted for by the presence of 
correlations in the scale of the fluctuations, represented by the square of the 
increments. 

7 Conclusion 

By comparing the statistical properties of price increments of S&P index fu- 
tures at various time scales, we have shown that studying the resolution de- 
pendence or scaling behavior of the variance and the kurtosis of the empirical 
distribution of price increments enables us to extract much more information 
than studies on a given time scale. In particular, the scaling properties of 
the kurtosis of the price changes may be recovered from the correlation func- 
tion of their squares, which exhibits a slow power law decay with exponent 
a = 0.37 ±0.037. Such relations may be generalized to other cumulants and 



k{Nt) = 
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moments but which are less interesting from an economic point of view. 
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Appendix : relation between scaling behavior of kurtosis and 
correlation of increment squares 

Let Sxk = x(t + kr) —x(t+ (k — 1)t). We derive here the relation between 
the kurtosis and the correlation function of the square of the increments 
g(k) defined in equation (1). Let us furthermore suppose without loss of 
generality that E[Sxk = 0] (see also the remark in section 3). The fourth 
moment /i^Nt) is given by 

Ha(Nt) = E[(x(t + Nt) - x{t) f] 

N 

k=i 

N 

= E[ SxiSxjSxkSxi] 

i,j,k,l=l 

N N 
i=l i>j i>j>k=l 

+3y y j E\SxjSxk] + E[SxiSxjSxkSxi] 

i>k i>j>k>l 

We will suppose now that the increments may be written as 

8x k = e fc 7fc e k G {+1, -1}, j k > 

where (e^) is a sequence of independent sign variables, but the sequence 
(7^) may have an arbitrary dependence structure. Since 5x k is centered 
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Ee k = 0. This hypothesis captures well the behavior of the increments: their 
magnitudes are correlated ("ARCH effect") but their signs random. We 
furthermore suppose that 7fc is independent from ej. As a result, most of the 
terms appearing in the above sum vanish: 

E[8x\8x k ] = E^e^k] 

= E[e t ]E[tk]-E[lhk] = 

E[8xi8xj8x k 8xi] = E[e i ]E[e j ]E[e k ]E[e l ]E['y i 'Y j 'y k 'y l ] = 

E[6^SxjSx k ] = E[e j ]E[e k ]E[>yf>y j >y k ] = 

The expression for the fourth moment therefore reduces to: 

N 

i=l i>j 
N N 

= E E[8x%] + 6 £(iV - j)E[8x 2 M +j r\ 
i=i j=i 

From (1), E[8xj8x^\ may now be expressed in terms of the correlation 
function g: 

E[8xl8x) T \ = (jn - a(rf) g(\i - j\) + <j(t) 4 
hence, by substitution, the relation between /i 4 and g: 

^(Nr) = N^(r)+6j2(N-k)[a(T) 4 + (^-a(r) 4 )g(k)} 

k=l 

= N fH (T)+3(N 2 -N)a(T) 4 + 

N N 

= 6N(vu(t) - <t(t) 4 ) £ g(k) - 6(^ 4 (r) - <x(t) 4 ) £ kg(k) 

k=l fc=l 

The kurtosis for a time resolution T = Nt is therefore given by: 
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N^(r) 1 q _i_ p.( n (~\ f ~ %(*) 



In our case g(T) ~ ^ which implies, for k(T): 



^ k(t) %o(.(t) + 2) /v 2 - iv 2 - 

«(r) + 6g„( fJ (r)+2) 



AT (2-a)(l -a)iV a 
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FIGURE CAPTIONS: 



Figure 1: Probability density of 5 minute increments of S&P 500 index future 
prices. The lower curve is a gaussian with same mean and variance. 

Figure 2: Scaling behavior of the variance of price increments: the variance 
increases approximately linearly with the time scale, which is consistent with 
the absence of significant linear correlations. 

Figure 3: Scaling behavior of the kurtosis of price increments. The kurtosis 
decreases much more slowly than 1/N: a power law fit gives an exponent 
close to 0.5 . 

Figure 4: Autocorrelation function of price changes. Time lag is given in 
multiples of r = 5 minutes. 

Figure 5: Behavior of some non-linear correlation functions of price changes. 

Figure 6: Autocorrelation function of absolute price changes: contrary to the 
price changes, their absolute values display a persistent character. 

Figure 7: Autocorrelation function of the square of price changes. Note the 
slow decay, well represented by a power law with exponent a = 0.37. 

Figure 8: The scale dependence of the kurtosis (data points) may be recon- 
structed (solid line) from the autocorrelation function of the square of the 
price increments, approximated by a power law (Eq.l). 
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